The reasoning: In the current frame, you are facing a birch wood block, which is part of a tree. The task is to chop a tree, and since you haven't obtained the target material (birch wood) yet, the next action is to attack the block in front of you to start gathering the wood. There's no need to adjust the camera as the target is already centered in your view, next action: attack, and next frame: 